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Information thermodynamics provides a framework for studying the effect of feedback loops on 
entropy production. It has enabled the understanding of novel thermodynamic systems such as 
the information engine which can be seen as a modern version of ‘Maxwell’s Daemon’, whereby the 
feedback controller is acting as a Daemon, processing information gained about the system in order 
to do work. Here, we analyse a simple model of such an engine and provide a detailed analysis of its 
fluctuation properties, including the large deviations of information. We find an exact expression of 
the large deviation rate function for a two-site version of our model, and provide an approximate 
analysis for larger systems which is corroborated by simulation data. 


I. INTRODUCTION 

The groundwork for information thermodynamics was 
laid down by Maxwell as part of his now infamous 
thought experiment ‘Maxwell’s Daemon’ [1]. In the ex¬ 
periment, a sentient agent monitors the motion of ther¬ 
mal particles inside a partitioned container. By operat¬ 
ing a small gate in the partition, fast-moving particles are 
allowed to move to one side of the partition while slow 
moving particles are allowed to move to the other side, 
thus heating up the first side and decreasing the overall 
entropy of the system. 

While first conceived to elucidate the statistical sub¬ 
tleties of the 2'^'^ law of thermodynamics, the experiment 
has since sparked many debates [2-6] on the nature of 
the perceived violation of the 2”^^ law. Note that these 
violations relate to the ensemble average and are differ¬ 
ent from the temporary violations occuring on the level 
of individual trajectories or events [7-9]. 

The culmination of nearly 150 years of discussions 
about Maxwell’s Daemon has been the framework of in¬ 
formation thermodynamics [10], which illuminates the 
profound importance of information in thermodynam¬ 
ics [11, 12]. By including the effect of system memory [13] 
and/or information processing (often thought to be per¬ 
formed by the feedback device, referred to as a ‘Dae¬ 
mon’ [14]), the 2'^'^ law can be reformulated to include 
information entropy and used to study the operation of 
finite-time thermodynamic systems [15] such as informa¬ 
tion heat engines [16]. 

‘Information heat engines’ are a class of thermody¬ 
namic systems that use information processing to do 
thermodynamic work without the need for a change in 
free energy [17, 18]. Methods such as feedback control al¬ 
low the engine to use information gained about a physical 
system to decrease the system entropy and hence extract 
useful work from the system [19]. This does not consti¬ 
tute a violation of the 2'^'^ law as it is understood that 
the operation of the feedback device entails an amount 
of entropy production at least equal and opposite to that 
change in the system [4, 20]. 

A quantitative relationship between entropy and in¬ 
formation is provided by the information thermodynamic 


framework, which gives the universal upper bound on the 
mean negative entropy production that can be obtained 
by feedback control [21]. 

To be precise, the ‘2"'^ law of information thermody¬ 
namics’ states that the entropy production St, of a sys¬ 
tem up to time t is related to the information. It, gained 
by the Daemon via the inequality [22] 

{St)>-{It), (1) 

where the angle brackets denote the ensemble average. 
In fact, this turns out to be a corollary of the generalised 
integral fluctuation theorem [10] (itself a generalisation 
of the Jarzynski equality [23]), 

(e-^‘-^*) = 1. (2) 

this implies that for It ^ 0, 

+ 1, (3) 

that is, the standard integral fluctuation theorem does 

not hold in the presence of feedback [24]. These results 

have been experimentally verified in small systems, where 
thermal fluctuations have a strong influence [17, 25, 26]. 
In previous theoretical studies, the quantity It and its re¬ 
lation to St has been discussed in the context of Langevin 
equations and continuous-time Markov chains [27], and 
the mutual information between the feedback controller 
and the stochastic system has been considered for sys¬ 
tems with discrete events [28]. 

Here, we study an abstract model of an ‘information 
motor’ [28-31]. The information motor discussed here 
is type of ratchet that is able to move a single particle 
against a bias using only the particle’s own random mo¬ 
tion and a feedback mechanism. Specifically, this model 
allows us to demonstrate a method for calculating the 
information It in a discrete-time feedback system and to 
study its fluctuation properties. 

The paper is structured as follows. Sec. II contains 
an overview of the existing information thermodynamic 
framework. In this section we also detail the method 
used for calculating the information gained (in a single 
measurement). In Sec. Ill we describe our simple model 
of a Maxwell’s Daemon type feedback system. In Sec. IV 
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we discuss the fluctuations of information on the level of 
individual trajectories and how to obtain large deviation 
rate functions. In Sec. V we obtain exact expressions for 
the large deviation rate function in a two-site version of 
our model. In Sec. VI we then give a detailed approxi¬ 
mate analysis and obtain numerical results for larger in¬ 
formation engines. In Sec. VII we conclude by summaris¬ 
ing our results and discussing potential further work and 
open questions. The appendices contain further details 
of the two-site system and properties of individual tra¬ 
jectories. 


II. FRAMEWORK 

For simplicity’s sake, let us consider a stochastic sys¬ 
tem evolving in discrete time and having states in some 
finite state space of size L. The state of the system at 
a time s is represented as a random variable with a 
specific realisation denoted by a:s. A trajectory of the sys¬ 
tem is written Xt = {Xs}*_g with a specific realisation 
denoted by Xj. The probability of a transition between 
states X and x' is written as l>j{x x'). 

For a system subject to general feedback, we consider 
that the transition probabilities are determined by some 
other parameter referred to as the control parameter. In 
purely ‘open-loop’ control, the control parameter is in¬ 
dependent of the system state. However, in the case of 
‘closed loop’ or ‘feedback’ control, the system’s evolu¬ 
tion influences the control parameter in a closed causal 
loop [32]. 

In the case of Maxwell’s Daemon, the Daemon is identi¬ 
fied as a feedback controller [22] whose activity can be de¬ 
scribed by two processes, measurement and control. The 
measurement is the process that allows the controller to 
select a control parameter to ‘feed’ back into the system 
via the control process as described above. The measure¬ 
ment is represented in a similar fashion to the system 
trajectory, and is written as Yt = {Vs}s=o- The mea¬ 
surement Ys at time s only depends on the current state 
Xa and so we denote, 

PiUs I Xs) := P [Ys = Us \ Xs = a:^] (4) 

as the probability of obtaining outcome i/s given that 
the system is in state Xs- Here we assume an injective 
mapping between measurement outcomes and control pa¬ 
rameters, that is, a given measurement ys determines a 
unique control parameter and thus along with the depar¬ 
ture state Xs determines the probability of transitions to 
the next state a^+i; we write this as 

uj{xs ^ Xs+i\ys)- (5) 

The conditional distribution in (4) is derived from P, 
the path space measure of the full process {(Xg, Ys)}*^g. 
This process is a Markov chain on the state space given 
by X and Y pairs and can be described by the transition 
matrix 

n{{x, y) (Y, y')) := uj(x x' \ y)p{y' \ x'). (6) 


We also write P{xs) = P [Xg = Xg] and P{ys) = 
P [Yg = yg] for the marginal distributions of the process 
and measurement trajectories respectively. Note that, 
while the measurements Yg are conditionally independent 
given the path Xt, the marginal measurement process Yt 
exhibits correlations after integrating out Xf and is not 
a sequence of independent identically distributed (i.i.d.) 
random variables. Extending the definition in [33], the 
entropy production at time s as a function of a given 
trajectory (xf,yt) in a feedback system is 


wfccg+i Xs I ys) 


(7) 


As mentioned above, for a system with feedback it 
is also necessary to quantify and study the information 
gained through measurement. When considering the in¬ 
formation gained in a single measurement, we follow [34] 
and use the ‘change in uncertainty’, an information the¬ 
oretic quantity that quantifies the information gained 
upon making an observation of some process. For a single 
measurement this is given by 


A/g = In 


= In 


PjVs I Xs) 

P{Vs I ys-iV 
pjVs I Xs)P{ys-i) 

P{y.) 


( 8 ) 


for all s > 1, where 


P{ys I Ys-i) = P [Ys=ys I Yo=yo, •. ■, Yg_i=j/g_i]. (9) 

Defining 7ro(a::o) = P [Xq = xq], the denominator term 
in (8) is given by 

PiYs) [X^+i = Xg+i, Yg = yg], 

X^ + l 

s 

= ^ 7ro(a;o) w(a;„ Xu+i \ y«)p(r/« | Xu). 

Xs + I u—0 

( 10 ) 

We can evaluate (10) by writing the sum as a matrix 
product, representing the terms in the sum as the ele¬ 
ments of matrices 


:= u){x x' 1 ys)p{ys \ x). (11) 

We write the initial probability distribution as a vector 
(ttoI = (7ro(l), 7ro(2),..., 7ro(T)) and also define a sum¬ 
mation vector of length L, [1) = (1,1,..., 1)^. We can 
then write P{yt) as 


t 

P{yt) = {7To\l[^y^\^^- ( 12 ) 

s=0 

For any given series of measurements yt we can calcu¬ 
late P{yt)- Each of the matrices has a size Lx L and we 
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must multiply t of these matrices to calculate the infor¬ 
mation obtained up to time-step t. Eq. (8) can then be 
written as 




\np{ys I Xs) 


(^oinrio 

(^oim=o 


My, |1) 


(13) 


We will see in Sec. V that this representation of A/s also 
clarifies potential cancellation of terms in the products. 

Here we are interested in the fluctuations of the to¬ 
tal information gained It , which is obtained by summing 
terms (8) along a given trajectory (xt,yt). This yields 




s=0 


pjys I Xs) 
Pivs I ys-i)' 


= In 


Pjyt I xQ 

Piyt) 


(14) 


Here we have used the conditional independence of the 
measurements P(yt | xt)= | Xs), defined y_i 

such that P{yo \ y_i) = P{yo) and used P{yt) = 


III. MODEL 



FIG. 1: Schematic of model. The feedback mechanism 
makes a measurement of the particle position and 
places a barrier according to the outcome of the 
measurement. Ifj/ = a;or?/ = a:-|-l then the barrier 
influences the particle movement by preventing certain 
jumps as in the top and middle right. For all other 
measurement values, the barrier has no effect and the 
particle moves freely as in the bottom right. 


We consider a model that abstractly resembles a recent 
experimental set-up involving a colloidal particle rotat¬ 
ing in an electric field [17]. The experimental system 
demonstrated a type of particle ratchet where a field can 
be switched and shifted along with the motion of the par¬ 
ticle in order to block it from moving. By ratcheting in 
this way, the particle’s own thermal motion can be used 
to do work. 

Our model is comprised of a random walk on a one 
dimensional lattice with a movable barrier. The random 
walk acts as an analogy to the colloidal particle in the ex¬ 
periment, with the walker’s random motion modeling the 
thermal motion of the experimental particle. We model 
in discrete space, as in the real experiment there was a 
single coarse-grained measurement, essentially allowing 
the identification of discrete ‘states’. 

Furthermore, the measurements were performed at 
regular intervals which allows the whole feedback process 
to be thought of in discrete time-steps. In our model, the 
random walker moves between sites on a lattice of size 
L with periodic boundary conditions and probabilities q 
and p of jumping respectively left and right at each time- 
step, such that p + q = 1. The random walker’s motion 
is then described by a single parameter p, which in the 
case where p ^ q describes a system with a bias in one 
direction. Without loss of generality, we consider q > p 
for biased systems. We label jumps left as ‘down’, and 
jumps right as ‘up’ as though the particle were moving 
in a potential. 

In the spirit of the feedback process described in [17], at 
each time-step a measurement is made of the particle po¬ 
sition and the barrier is moved to the measured location 


of the particle in an attempt to prevent it from moving 
down. When the particle attempts to jump ‘through’ the 
barrier, it instead remains at the same site. If the mea¬ 
surement is always correct, the particle can only ever 
jump up and so the system acts as a perfect ratchet. 
However, if the measurement is incorrect then the bar¬ 
rier will be placed incorrectly and will either not affect 
the particle’s motion or will act as a blockade for jumps 
up. The system is initialised by first choosing a site Xq 
uniformly from the lattice sites and then performing the 
measurement process to obtain a yg. Fig. 1 is a schematic 
diagram of the system where the bias is represented by 
showing the lattice as a staircase. The three possible 
results of the action of the feedback device are shown, 
including the situation where the feedback has no effect 
on the particle motion. 

The model can be described by three parameters: the 
lattice size L, the motion bias p and the measurement 
accuracy r which we define as 

r := p(x I x) V X € {1,... ,L}. (15) 

The corresponding error probabilities 

w:=p(ylx) V y,x G y ^ X, (16) 

are related to r via the normalisation condition w = 
(1 — r)/{L — 1). The measurement error is then inde¬ 
pendent of which site x the walker is at and all incorrect 
measurements (i.e. any y ^ x) are equally likely. 

We consider the case of ‘accurate’ measurements (r > 
w) as in this regime the measurements can be used to 
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FIG. 2: Numerical tests of (2), (3) and (18) in semi log 
scale. Parameter values are L = 3, p = 0.2, r = 0.9. 
Averaged over 10^ realisations. Error bars indicate the 
standard error of mean of at representative 

points. 


make useful inferences about the system state and the 
information gained through measurement can be used in 
the operation of the information engine. Contrastingly, 
in the special case r = w = 1/A, the joint probability 
in (10) factorises and (14) is always zero; no information 
is ever gained by the Daemon and hence the system re¬ 
duces to a ‘lazy’ random walk. When r < w it is possible 
for the Daemon to gain information, but we do not study 
this regime as it is not clear in general how to utilise 
the information gained from a measurement device that 
measures wrongly more frequently than correctly. 

These three parameters, p, r and L completely char¬ 
acterise the model. At each time-step the contribu¬ 
tion to the entropy production is given by (7) and is 
G {— In ip/q), 0, In (p/q)}, where the non-zero terms 
correspond to successful jumps down or up and 0 is 
the entropy produced if the particle attempts to move 
through the barrier. The information gained by the feed¬ 
back controller is given by (13). All three parameters 
determine the average particle current 

= pr + pw{L — 2) + qw{l — L), (17) 

where positive current is in the rightward direction. Our 
choice of parameters {q > p and r > w) can produce a 
positive average current (current against the bias), when 
in the absence of feedback, zero or negative current would 
be expected. 

As a preliminary to Sec. IV, we numerically check the 
generalised integral fluctuation theorem (2), show that 
the standard integral fluctuation theorem (3) does not 
hold, and check the equality, 

(e-^*) = 1, (18) 

which follows from the definition (14). Fig. 2 confirms 
these (in)equalities at different times t with numerical 
data obtained from Monte Carlo simulation of a biased 


three-site system. Since the means of exponential quanti¬ 
ties are determined by rare events, fluctuations are large 
as indicated by the error bars on representative points. 


IV. INFORMATION FLUCTUATIONS AND 
LARGE DEVIATIONS 

A. Fluctuations of AA 

Having numerically checked the (2), (3) and (18), we 
now proceed to investigate in more detail the fluctua¬ 
tion properties of the information gained by the Daemon. 
The variables It/t and AA are of interest, being the time- 
averaged information gain and instantaneous change in 
uncertainty, respectively. On the level of an individ¬ 
ual trajectory there are three types of events possible, 
given the system dynamics, that influence AA. These 
are roughly described as follows, 

• Correct measurements 

• Incorrect measurements that are recognisable as 
such 

• Incorrect measurements that are not recognisable 
as such 

We observe from the analysis of trajectories in App. A 
that correct measurements yield positive amounts of in¬ 
formation. Correctly observed jumps against the bias 
yield more information than blocked jumps in the direc¬ 
tion of the bias. For symmetric systems the difference 
in AA between jumps left and right is small. Series of 
correct measurements yield one of two baseline values for 
information gain AA that correspond to jumps up and 
blocked jumps down (see App. A). 

If an incorrect measurement is made, then the ratio 
P(ys-i)/P(ys) in Eq. (8) changes and the amount of 
information gained is zero if the current measurement is 
not compatible with the previous measurement (e.g. the 
particle looks like it has jumped ‘through’ the barrier 
or jumped more than one site). A sequence of incorrect 
measurements can lead to negative values in information 
gain, interpreted as a change towards greater uncertainty 
of the system state. 

Whenever a series of incorrect measurements is made, 
the next series of correct measurements gains large 
positive amounts of information that partially retrieve 
the ‘lost’ information of the incorrect measurements. 
As these correct measurements are made, the ratio 
P(ys-i)/P(ys) relaxes and AA returns to a baseline 
value which we explain in detail in App. A. The infor¬ 
mation theoretic interpretation is that successive correct 
measurements allow the observer to infer which measure¬ 
ment was incorrect. However, for consistent incorrect 
measurements which are not detectable the information 
lost cannot be retrieved. A discussion of this with an 
example is given in App. A. 
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Large positive deviations of It/t are not generated by 
an accumulation of the largest values of A/g as these are 
necessarily preceded by large negative values as described 
above. They are instead generated by strings of correct 
measurements which each generate less information than 
the largest values of A/g. 

In contrast, large negative deviations are generated by 
sequences of incorrect measurements which happen to 
represent a possible system trajectory. In this case large 
negative values of A/g can accumulate and are not com¬ 
pensated by subsequent large positive values. In general, 
since the baseline values discussed above depend on the 
trajectory in case of correct measurements, atypical tra¬ 
jectories X( also play a role in the realization of large 
deviations of It/t. 

B. Large Deviation Analysis 

In order to study the fluctuation properties of It/t, 
we follow standard methods (see, e.g. [35]) and start by 
assuming that It/t obeys a large deviation principle of 
the form, 

(19) 

as t approaches infinity, where E(i) is the ‘large deviation 
rate function’ [36]. The rate function tells us about the 
fluctuation properties of the variable /t/t, and allows us 
to quantify how exponentially unlikely a given fluctuation 
away from the mean is, in the long-time limit. 

In order to calculate the rate function, we first consider 
the scaled cumulant generating function (SCGF), 

^(fc) := lim --lnG(fc), (20) 

>-oo t 

where G{k) is the moment generating function, 

/ oo 

e-'=“P[/t = rt]du. (21) 

-OO 

The rate function is then the Legendre-Fenchel transform 
of the SCGF 

E{i) = sup [^{k) — ki]. (22) 

fcGR 

By rewriting an expectation of the form (21), the 
SCGF can often be obtained as the logarithm of the prin¬ 
cipal eigenvalue of the Markov transition matrix which is 
weighted to count the relevant quantity [37]. This ap¬ 
proach can be applied if the measured quantity depends 
additively on transitions along a system trajectory, as is 
the case with particle current. Equation (8) shows that 
this is in general not the case for information, since the 
gain in a given measurement depends on the entire mea¬ 
surement history up until that point. Hence transitions 
on the enlarged state space of pairs (xg, 2 /g) cannot be 
associated with specific values of A/g. 


However, in the next section we show that for L = 2, 
A/g can be simplified using Eq. (13) to an expression 
that only depends on the departure and target states 
in a single transition, and the above approach can be 
applied by weighting the transition matrix (6) to count 
information gain leading to an exact analytical expression 
of the large deviation rate function. This simplification 
does not hold for L > 3, and in Sec. VI we describe 
approximate methods for obtaining the rate function by 
formulating a one-step Markov model for the sequence of 
A/g. 

V. EXACT COMPUTATION FOR TWO-SITE 
SYSTEM 

For a system with two sites labeled 1 and 2, the ma¬ 
trices used in (12) are 

pG (23) 

y qw pw J ^ ^ 

/pw qw\ 

\ pr qr J ' ^ 

corresponding to the two measurement outcomes. For 

this system, the My matrices for any L are similarity 

transforms of one another and have the same spectrum. 
For the L = 2 case these matrices have only one non-zero 
eigenvalue A = qr+pw, allowing them both to be written 
as tensor products on the eigenspace of the corresponding 
eigenvector. That is, we can write My = \vy) {Vy\ where 
{Vy\ and \Vy) are the left and right eigenvectors of My 
with respect to A. The matrices can be rewritten as 

Mi = \v{){v[\=(^l'^{q p) (25) 

and 

Ma = ju^) (u^l = ^ ^ ^ (p (? ) . ( 26 ) 

Writing the matrices in this way shows that all but the 
final term of the upper product in (13) cancel with terms 
in the lower product, leaving an inner product between 
two vectors: 

P{ys) _ (tToI MygMy.^ . . . My^_.^My^ [l) 

P(yg_l) {'KQ\My„My^...My^_^\l) 

(ttoI - 11) 

= «-JO- (27) 

P(yg)/P(ys_i) then can take four values corresponding 
to the values taken by ys-i and ys, i.e., ys-i,ys G {Ij 2}. 
However, as the two My matrices are permutations of one 
another, only two distinct values can be obtained, when 
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FIG. 3: Cumulative density function for Alg for 
different system sizes L with p = 0.2 and r = 0.9. 
Dashed lines give theoretical values for L = 2 as given 
by (29) and(30). 



FIG. 4: Large deviation rate function (22) for L = 2, 
p = 0.2, and r = 0.9. The solid black line shows the 
theoretical curve obtained from Eq. (34) and (22). 
Points represent data sampled at different finishing 
times t. 


Us 7 ^ j/s-i or when = ys-i- The change in information 
upon making a measurement in this two-site system is 


Alg = In 


PjVs I Xs) 


(28) 


which for a given Xs depends only on the previous and 
current measurements ys-i and j/g. From (28) it can 
further be deduced that the process {A/s}‘^g is simply 
a sequence of i.i.d. random variables. This simplification 
only holds for L = 2 and is demonstrated in App. B. 

As p{ys I Xg) has two possible values and | VyJ 

also has two possible values, Alg takes four possible val¬ 
ues. Specihcally, these are 

In- ;= a, In- := b, (29) 

pr + qw qr -\- pw 

for correct measurements made after jumps in the up and 
down directions (whether blocked or not), respectively. 
For the same cases followed by incorrect measurements, 
Alg takes the values 

w w 

In- - - := c. In- - - := d. (30) 

pr + qw qr + pw 


The values taken by Alg can be associated with tran¬ 
sitions on the state space {xs,ys), which is described by 
the transition matrix (6). For L = 2 this matrix is 

( qr qw pw pr 
pr pw qw qr 
qr qw pw pr 
pr pw qw qr 

The values in (29) and (30), along with the probabilities 
of these events occurring, given by the transition ma¬ 
trix (31), are enough to determine the cumulative density 
function (GDF) of the random variable Alg for all s > 1 

GDFA/(Ai) := P [AI < Ai] V Ai e R, (32) 



which is plotted in Fig. 3 and compared with numerical 
data. 

It is possible to weight the Markov transition ma¬ 
trix (31) with the values of Alg from (29) and (30) to 
obtain the tilted matrix. 


n'{k) 


/ e~^^qr e~‘^^qw e~‘^’^pw e~°‘^pr \ 

e-°‘^pr e~'^^pw e~'^^qw e~’^'^qr 

g-bkqr g-cfegy; Q-dkpyy ^-akpj, 

y e~°'^pr e~'^^pw e~'^^qw e~^^qr j 


(33) 


which has principal eigenvalue 

A(/c) =e-“V + -b e-^'^qw -b e-’^^pw. (34) 


The logarithm of (34) is taken as the SCGF ^(fc) and Leg¬ 
endre transformed according to (22) into the rate func¬ 
tion Eii). Fig. 4 shows this rate function plotted with 
data from simulation for a two-site system. In the long¬ 
time limit the data converge well to the rate function. 


VI. APPROXIMATION FOR LARGER 
SYSTEMS 

A. General behaviour 

For systems with three or more sites, the process 
{(As,Yg)}g^g is still a Markov chain with a stationary 
state. However, unlike the L = 2 case, the information 
gained in each measurement along a trajectory {A/s}‘_g 
is not a sequence of i.i.d. random variables. Eq. (13) 
cannot be reduced to a simpler form as the matri¬ 
ces for L > 3 have more than one non-zero eigenvalue, 
and cannot in general be written down in a form that al¬ 
lows cancellation like (27). Indeed, {A/s}‘^g is also not 
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FIG. 5: A scatter plot of A/g+i against Alg for L = 10, 
p = 0.5, r = 0.9 to illustrate correlations as explained in 
the text. Histograms on the axes show the density of 
points on the plot. 


a Markov chain because each value depends on the en¬ 
tire trajectory (x^, yg) up to that point, which is a larger 
object at each successive value of s. 

From simulation we that the information A reaches 
linear growth after some initial transient, and so we ex¬ 
pect the information increments AJg to also converge to 
a stationary distribution. As the matrices used to cal¬ 
culate A/g all have principal eigenvalue < 1 (for 

all parameters except 1 — p = r = 1), the system should 
exhibit an exponential decay of correlations. This allows 
us to assume that, at long times, A/g only has significant 
dependence on a finite number of the previous measure¬ 
ment outcomes / events. 

We numerically investigate the behaviour of A/g for 
systems with A > 3, specifically we focus here on results 
for L = 10. Other L values (larger and smaller) do not 
significantly differ in their general behaviour or numerics. 
Fig. 3 shows the CDF for A/g, for various system sizes 
up to A = 20. The shape of the function and the position 
of the minimum, maximum and most likely intermediate 
values do not vary significantly. The scaling of the most 
likely, minimum and maximum of A/g with A is detailed 
in Appendices A and C. 

Fig. 5 shows a scatter plot of A/g+i against A/g for 
A = 10. Independent random variables plotted this 
way would produce a symmetric cloud or grid of points. 
However this plot features diagonal patterns which cor¬ 
respond to correlation between the two variables. The 
projected probability densities are shown as histograms 
along the axes. The histograms suggest that A/g and 
A/g_|_i are identically distributed as expected. To verify 
this and understand how successive values of A/g are cor- 
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FIG. 6: Sample autocorrelation function of A/g (35). 
Shows significant negative correlations after one time 
step. 95% confidence intervals plotted as blue dashed 
lines. A = 10 and r = 0.9 in both cases. 


related, we compute the sample autocorrelation function 
(AGF) defined as 


ACFa/,(t) 


- {AimAh+r - (AI)) 
(t-T)ELi(A/.-(A/))2 

(35) 


Fig. 6 shows the ACF of AI after an initial transient 
period for biased and unbiased cases with 95% white- 
noise confidence intervals. The autocorrelation functions 
show significant negative correlation between A/g and 
A/g_|_i, but beyond this no significant correlation. That 
is, the information gained at successive time-steps is anti¬ 
correlated. This is because when an incorrect measure¬ 
ment is made the next measurement is likely to be cor¬ 
rect, as correct measurements are more probable, and 
the amount of information gained will be positive (see 
App. A). 

While successive correct measurements do each con¬ 
tribute positive amounts of information, they do not dif¬ 
fer as radically as the change from a negative to positive 
amount of information. Fig. 6 also suggests that biased 
systems are less strongly anti-correlated than unbiased. 
In the next subsection we use the one time-step correla¬ 
tion and distribution of A/g as grounds for constructing 
a single-step Markov chain model of {A/g}‘^g that we 
believe captures most of the relevant features. 


B. One-step Markov chain model 

To obtain an approximate rate function, we assume 
that after an initial transient, {A/g}g^g is described by 
a stationary one-step Markov chain taking values in a 
continuous range. We define a finite state space A by 
coarse-graining this range and replacing A/g by its ex¬ 
pected value in each bin. The transition matrix for this 
process is then obtained by binning data from a scatter 
plot such as Fig. 5 into these states and normalising the 
number of counts in each bin. To count the information 




















FIG. 7: The large deviation rate function (22) for 
L = 10, p = 0.5, r = 0.9. Lines show rate functions 
obtained from the Markov approximation with different 
numbers of bins. The lines for 12 and 24 bins are not 
distinguishable at this scale. Points represent data 
sampled at different finishing times t. The cutoff for 
positive deviations is explained in the text and given in 
Eq. (36), and is represented by a vertical line. 


gain, the new Markov matrix on the state space I is then 
weighted with the value of A/g in the target state. The 
SCGF (and thus the rate function) can then be obtained 
from the largest eigenvalue of this tilted transition ma¬ 
trix. 

To check the method it can be shown that for L = 2, as 
the number of bins is increased and more data is used in 
the scatter plot, the method converges to the analytically 
obtained rate function for that case. Fig. 7 shows the rate 
function obtained through this method for L = 10 plot¬ 
ted alongside data obtained from simulation. The figure 
shows convergence of the estimated rate functions with 
increasing number of bins, which appears to be consistent 
with the data. This confirms the validity of the one-step 
Markov approximation for a wide range of fluctuations. 
However, as can be seen in Fig. 7, the data indicate a cut¬ 
off in the rate function that the Markov approximation 
does not predict. In the next subsection we explain this 
feature by noting that large fluctuations of A/g do not 
accumulate in the way that the Markov model allows. 


C. Beyond Markovian analysis 

To correct the numerically obtained rate functions 
shown by solid lines in Fig. 7 we must consider the maxi¬ 
mum possible value for It/t. Fig. 5 suggests that it is pos¬ 
sible to obtain large amounts of information on consecu¬ 
tive time-steps (the top right corner of this scatter-plot 
has a small but non-zero population). However, investi¬ 
gation of individual trajectories of the system reveals that 
consecutive large positive amounts can only be obtained 
after consecutive large negative amounts (see App. A). 
This is not reflected in the ACF in Fig. 6 owing to the 


fact that these events occur very rarely, as can be seen 
from the marginal histograms in Fig. 5. 

The maximum value of It/t is obtained by measuring 
correctly every time-step. This maximum value is given 
by 

J" r 

(36) 

t a 

where a is numerically obtained from the My matrices 
for that system. A discussion is included in App. A. 

Fig. 7 shows the cut-off value for an unbiased system 
with a black vertical line. Unlikely trajectories in biased 
systems that always step against the bias can generate 
large positive deviations of It/t and so the rate function 
cuts off at higher values. The cut-off is therefore less 
relevant when comparing data for biased systems to the 
predicted rate function. 

The A/g process is clearly not a one-step Markov chain, 
and the rate function obtained this way is also limited by 
finite sampling of transitions and limitations on the num¬ 
ber of bins used. However, the one-step Markov model 
gives a rate function that converges reasonably quickly 
with increasing number of data points and together with 
the cut-off, captures well the shape of the sampled data 
in a way that a Gaussian or i.i.d. approximation would 
not. An n-step Markov chain model might also capture 
this but it appears that simply including the cut-off at 
the maximum value is sufficient to obtain a good approx¬ 
imate rate function. 


VII. DISCUSSION 

The information gain It is a quantity recently intro¬ 
duced in the analysis of feedback systems [34, 38] and 
studied as a component in the development of informa¬ 
tion thermodynamics and information engines [39]. The 
fluctuation properties of this quantity are relevant when 
considering information processing in feedback devices; 
the quantity of information gained is directly propor¬ 
tional to the work required to delete that information 
from the feedback device’s memory. 

In this paper we have studied a simple model of an 
information engine and obtained an exact analytical ex¬ 
pression of the large deviation rate function for informa¬ 
tion gain It in a two-site system. For larger systems we 
have shown that a one-step Markov approximation cap¬ 
tures most of the relevant details of the large deviations. 
We are also able to predict the cut-off of this rate func¬ 
tion by considering the maximum amount of information 
that can be obtained. 

Significantly, the one-step Markov approximation al¬ 
lows us to easily obtain an approximation of the rate 
function from data by sampling the information gain at 
consecutive time steps. This is computationally easier 
than directly sampling the distribution of It/t (which re¬ 
quires very long times or cloning-type algorithms [40]) 
but together with the theoretically predicted cut-off 
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seems to provide a consistent estimate of the shape of 
the large deviation rate function. 

The rate functions obtained here demonstrate that the 
information gained by the measuring device in a simple 
Markovian feedback system shows a strong asymmetry 
around the mean. The cut-off value for this rate func¬ 
tion is sensitive to the dynamics of the system, namely 
whether the particle motion is symmetric or asymmetric. 
It would be of interest to study other information engines 
to check whether these findings are generic and to what 
extent the Markov approximation is applicable in other 
systems. The large deviation rate function offers the pos¬ 
sibility to explore detailed fluctuation relationships be¬ 
yond (2) for It- To obtain a detailed fluctuation relation¬ 
ship, care must be taken in deciding how to meaningfully 
time-reverse a feedback system. Discussions have already 
alluded [34, 41] to the potential difficulties in interpreting 
time-reversed feedback in a physically meaningful man¬ 
ner and there is evidently much scope for future work. 
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Appendix A: Trajectory Analysis 

To understand the various events that occur in the 
system, we plot trajectories of Alg, P{ys-i)/P{ys), Xg 
and Yg. Recall that Alg is given by Eq. (8) (reproduced 
here for readability) 

Alg = lnp{yg I Xg) +ln ^^^~|\ 

and hence Alg will always differ from lnP(ys_i)/P(ys) 
by In r or In re depending on whether the measurement is 
correct or incorrect. 

The value of P(ys-i)/P(ys) is determined by the mea¬ 
surement trajectory itself and is not directly dependent 
on the trajectory x^. We observe that there are two 
‘baseline’ values for this ratio that are obtained when 
the measurements represent a possible trajectory for the 
particle, i.e. the particle does not appear to jump more 
than one site in a single time-step and does not appear 
to move ‘through’ the barrier. These two baseline values 
correspond to the particle jumping up or being blocked 
attempting to jump down. 

If the particle is blocked at site x for successive steps 
starting, for example, at time s —3, the probability of the 
correct measurement history is calculated via a matrix 


product as in Eq. (12), 

P{yg) = {Vg.3\MyMyMy\l), (Al) 

where y = x and where (ug-a] = ^Vk- 

ratio P(ys-i)/P(ys) entering the information gain Alg 
(Eq. (13)) is then dominated by the leading eigenvalue 
of the matrices, and therefore should approach 
I/A^ax very quickly. Recall that all matrices have the 
same eigenvalues since they are related by translations 
of rows and columns. The corresponding lower baseline 
value for Alg is given by 

A/^ = ln^. (A2) 

^max 

On the other hand, let us assume that the particle 
jumps up for successive time steps, starting in site x at 
time s — 3, and we measure this correctly. Then the 
probability of the measurement history P{yg) is given by 
a product of matrices with increasing index 

P{ys) = {Vg-sl MyMy^lMy^2\^) 1 (^ 3 ) 

where y = x, and y -I- n is understood with periodic 
boundary conditions. The successive matrices are trans¬ 
lated by one column and one row, i.e. My+i = TMyT~^, 
where the translation T is such that 

i{v\T),^ = Vk+i and {T\w))f^ = Wk-i- (A4) 

Similar to the eigenvalue case, the expression (A3) then 
is dominated by vectors (uj and a scale factor a such that 

((^1 = “W+i, (A5) 

with periodic boundaries (i.e. {{v\My)^ = avi,)- So the 
upper baseline value for information gain Alg should be 
given by 

A/“ = ln-, (A6) 

a 

where a can be found numerically from (A5) for any given 
system. 

Figs. 8 and 9 show empirical data confirming our pre¬ 
dictions for the baseline values and demonstrate their 
scaling with L in unbiased and biased systems, re¬ 
spectively. In an unbiased system, the difference in 
P(yg_i)/P(ys) between an up and blocked down jump 
shrinks with increasing P, whereas the values are roughly 
constant for biased systems. Fig. 10 shows a typical sec¬ 
tion of a trajectory in a biased system. The baseline val¬ 
ues of P(ys-i)/P(yg) and Alg for blocked down and up 
jumps are seen around t = 310 and t = 315 respectively. 

Whenever a measurement is made that is incompati¬ 
ble with previous measurements (i.e. the particle looks 
to have jumped two or more sites or moved through the 
barrier), P(ys_i)/P(yg) changes value. Isolated incor¬ 
rect measurements as seen in Fig. 10 (at time t = 307) 
and Fig. 11 (at time t = 44) cause P(ys-i)/P(ys) to 
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FIG. 8: Baseline values for an unbiased system {p = q) 
for r = 0.9. Points show the value of A/g for 
consecutive up jumps and blocked down jumps. Lines 
show the predictions from (A2) and (A6). 



FIG. 9: Baseline values for a biased system, p = 0.2 and 
r = 0.9. Points show the value of Alg for consecutive 
up jumps and blocked down jumps. Lines show the 
predictions from (A2) and (A6). 


increase while we observe that the In w term means that 
Alg = 0. In the following measurements, P{ys-i)/P{ys) 
is still larger than its baseline value and as the Inr con¬ 
tribution is small in comparison, Alg is also larger than 
the baseline. 

The information theoretic interpretation of these ob¬ 
servations is that upon making a measurement that is not 
compatible with the previous measurements, no new in¬ 
formation is gained. This is because it is not clear to the 
observer whether the current measurement is incorrect, 
or the previous measurements were erroneous (or both). 
It is only on subsequent measurements that information 
is gained, as more measurements allow the observer to 
make inferences about which measurements were incor¬ 
rect. After an incorrect measurement, P{ys-i)/P{ys) 
(and hence A/g) returns quickly to a baseline value. 

To observe very large values of Alg and instanta¬ 
neously gain large amounts of information, it is neces¬ 
sary to first lose larger amounts of information through 
incorrect measurements. An example of this is shown in 



FIG. 10: Trajectory of a biased system for L = 10, 
p = 0.2, r = 0.9. The trajectory Xg is given by a gold 
line, and measurements by square boxes. The 
information gain Alg, given by a blue line, is always 
bounded above by ln(P(ys_i)/P(ys)) given by a red 
line. Note that these quantities have different units. 
Note also that for Xg and yg there are periodic 
boundary conditions between 0 and 10. 



FIG. II: Trajectory of a biased system for L = 10, 
p = 0.5, r = 0.9. See Fig. 10 for details of the plot. 


Fig. II, where the large amounts of information gained 
between t = 57 and t = 59 cannot balance the losses be¬ 
tween t = 53 and t = 55. Hence it is not possible to gain 
additional information by making strategically ‘wrong’ 
measurements, as a series of correct measurements would 
yield more total information. 

In the case of a wrong measurement that still 
represents a possible trajectory for the particle, 
P{yg-i)/P{yg) does not change but the Inw contribu¬ 
tion from the incorrect measurement means that Alg 
takes a negative value. If the following measurements 
are correct and also compatible with the previous wrong 
measurement, then subsequent measurements will only 
gain a baseline amount of information. An example of 
this is shown in Fig. 12. Here, a wrong measurement 
occurs at time t = 50 which is compatible with the pre- 
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FIG. 12: Trajectory of an unbiased system for L = 10, 
p = 0.5, r = 0.9. See Fig. 10 for details of the plot. 


vious history. Subsequent measurements do not allow 
the observer to ascertain that any of previous measure¬ 
ments were incorrect and hence this information loss is 
not recovered. 


Appendix B: Independence of AG for L = 2 

To demonstrate that AG are i.i.d. random variables for 
L = 2, we want to show that the distribution of AG_|_i 
is independent of AG and identical for all s. Let us first 
assume that the system is in the state {xs,ys) = (1,1)- 
The probabilities for AG-i-i taking the values in (29) and 
(30) are as follows: 


AG+1 

Transition 

Probability 

a 

(1,1) ^(2,2) 

pr 

b 

(1,1) ^(1,1) 

qr 

c 

(1,1) ^(1,2) 

qw 

d 

(1,1) ^(2,1) 

pw 


By the translation invariance in the system, the same 
probabilities apply if the system was in state {xs,ys) = 
( 2 , 2 ). 

If the system starts in the state {xg, ys) = (1, 2), then 
the probabilities are: 


AG+1 

Transition 

Probability 

a 

(1,2) ^(1,1) 

pr 

b 

(1,2) ^(2,2) 

qr 

c 

(1,2) ^(2,1) 

qw 

d 

(1,2) ^(1,2) 

pw 


which again by translation invariance also holds for be¬ 
ginning in the state (2,1). We can see then that the 


probability to obtain AG-i-i is in fact independent of the 
state {xs,ys) of the system. In particular, it does not 
depend on the previous value AG- Therefore, {AG}‘=o 



FIG. 13: Points show numerical results for Max(AG) 
and — Min(AG) (from 10^ trajectory realisations of 
length t = 500) and predicted values given by (Cl) 
and (C2) for varying L for p = 0.5, r = 0.9. 


Appendix C: Maximum and Minimum of AG 

Fig. 13 shows numerical results for the maximum and 
minimum of AG from 10^ realisations up to t = 1000 
for varying L. The minimum amount of information per 
time-step is obtained when an incorrect measurement 
is made and we argue that, as in the L = 2 case (see 
Eq. (30)) its value is given by 

Min(AG) = ln^. (Cl) 


The maximum value is observed to be exactly the min¬ 
imum value reflected across the lower baseline value for 
information gain (A2), i.e. it is given by, 

Max(AG) = In -Min(AG) = In —. (C2) 


This holds for all L > 4 as shown by the numerical re¬ 
sults in Fig. 13. For G < 3, the observed discrepancy is 
probably due to the fact that incorrect measurements are 
more constrained e.g. any barrier placement in a two-site 
system will interfere with the particle motion. 
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